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Abstract — The index coding problem has recently attracted 
a significant attention from the research community due to 
its theoretical significance and applications in wireless ad-hoc 
networks. An instance of the index coding problem includes a 
sender that holds a set of information messages X = 
and a set of receivers R, Each receiver p — {x,H) G R needs 
to obtain a message x ^ X and has prior side information 
comprising a subset H of X, The sender uses a noiseless 
communication channel to broadcast encoding of messages in 
X to all clients. The objective is to find an encoding scheme that 
minimizes the number of transmissions required to satisfy the 
receivers' demands with zero error. 

In this paper, we analyze the relation between the index 
coding problem, the more general network coding problem and 
the problem of finding a linear representation of a matroid. In 
particular, we show that any instance of the network coding and 
matroid representation problems can be efficiently reduced to an 
instance of the index coding problem. Our reduction implies that 
many important properties of the network coding and matroid 
representation problems carry over to the index coding problem. 
Specifically, we show that vector linear codes outperform scalar 
linear codes and that vector linear codes are insufficient for 
achieving the optimum number of transmissions. 
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Fig. 1. An instance of the index coding problem with four messages and 
four clients. Each client is represented by a couple (x,H), where x ^ X is the 
packet demanded by the client, and H C X represent its side information. 



I. Introduction 

In recent years there has been a significant interest in 
utiHzing the broadcast nature of wireless signals to improve 
the throughput and reliability of ad-hoc wireless networks. The 
wireless medium allows the sender node to deliver data to sev- 
eral neighboring nodes with a single transmission. Moreover, 
a wireless node can opportunistically listen to the wireless 
channel and store all the obtained packets, including those 
designated for different nodes. As a result, the wireless nodes 
can obtain side information which, in combination with proper 
encoding techniques, can lead to a substantial improvement in 
the performance of the wireless network. 

Several recent studies focused on wireless architectures 
that utilize the broadcast properties of the wireless channel 
by using coding techniques. In particular, [1], [2] proposed 
new architectures, referred to as COPE and MIXIT, in which 
routers mix packets from different information sources to 
increase the overall network throughput. Birk and Kol [3], 
[4] discussed applications of coding techniques in satellite 
networks with caching clients with a low-capacity reverse 
channel [3], [4]. 

The major challenge in the design of opportunistic wireless 
networks is to identify an optimal encoding scheme that 



minimizes the number of transmissions necessary to satisfy 
all client nodes. This can be formulated as the Index Coding 
problem that includes a single sender node s and a set 
of receiver nodes R. The sender has a set of information 
messages X = that need to be delivered to 

the receiver nodes. Each receiver p = (x^H) G R needs to 
obtain a single message x in X and has prior side information 
comprising a subset H C X. The sender can broadcast the 
encoding of messages in X to the receivers through a noiseless 
channel that has a capacity of one message per channel use. 
The objective is to find an optimal encoding scheme, referred 
to as an index code, that satisfies all receiver nodes with the 
minimum number of transmissions. 

With a linear encoding scheme, all messages in X are 
elements of a finite field and all encoding operations are linear 
over that field. Figure [T] depicts an instance of the index coding 
problem that includes a sender with four messages xi, . . . , X4 
and four clients. We assume that each message is an element 
of GF(2^), represented by n bits. Note that the sender can 
satisfy the demands of all clients, in a straightforward way, 
by broadcasting all four messages over the wireless channel. 
The encoding operation achieves a reduction of the number of 
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messages by a factor of two. Indeed, it is sufficient to send just 
two messages xi-\-X2-\-xs and xi +X4 (all operations are over 
GF(2^)) to satisfy the requests of all clients. This example 
demonstrates that by using an efficient encoding scheme, the 
sender can significantly reduce the number of transmissions 
which, in turn, results in a reduction in delay and energy 
consumption. 

The above example utilizes a scalar linear encoding scheme 
that performs coding over the original messages. In a vector 
encoding scheme, each message is divided into a number 
of smaller size messages, referred to as packets. The vector 
encoding scheme combines packets from different messages 
to minimize the number of transmissions. With vector linear 
index coding, all packets are elements of a certain finite field 
F, and each transmitted packet is a linear combination of the 
original packets. For example, consider the instance depicted 
in Figure [T] and suppose that each message , 1 < i < 4 
is divided into two packets, G GF(2^), of size n = 

^. Then, a valid vector-linear solution is comprised of four 
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Related Work 



Witsenhausen [5] considered a related zero-error coding 
problem with side information. He studied the point-to-point 
scenario where a transmitter wants to send a random variable 
X over a noisy channel to a receiver that has prior knowledge 
of another random variable Y jointly distributed with X. The 
objective of this problem is to find an encoding scheme that 
allows the receiver to obtain X with zero error probability. 
He observed that if the transmitter knows the realizations of 
y, then the solution can be found in a straightforward way. 
However, in the case when the transmitter is oblivious to the 
realizations of Y, then the problem becomes much harder. 
Specifically, the minimum communication rate in this case is 
related to the chromatic number of the powers of the channel 
confusion graph. Simony i [6] considered a more general case 
with multiple receivers and showed that the obvious lower 
bound, given by the rate needed by the "weakest" receiver, is 
attainable. Index Coding can be considered as a multi-terminal 
generalization of Witsenhausen 's problem, with non-broadcast 
demands and with the restriction that the side information be 
a subset of the original one. 

The index coding problem has been introduced by Birk 
and Kol [7] and was initially motivated by broadcast satellite 
application^ In particular, they developed several heuristic 
solutions for this problem and proposed protocols for practical 
implementation in satellite networks. Bar-Yossef et al. studied 
the index coding problem from a graph-theoretical perspective 
[7]. They showed that the index coding problem is equivalent 
to finding an algebraic property referred to as the minrank 
of a graph. Finding, the minrank of a graph, however, is an 
intractable problem [8]. Lubetzky and Stav [9] showed that 
non-linear scalar codes have a significant advantage over linear 
ones by constructing a family of instances with an increasing 
gap between the optimal number of transmissions required 

^Reference [7] refers to the index coding problem as Informed Source 
Coding on Demand problem (ISCOD). 



by non-linear and linear codes. Wu et al. [10] studied the 
information-theoretic aspects of the problem with the goal 
of characterizing the admissible rate regioij^ Reference [11] 
analyzed the hardness of approximation of the Index Coding 
problem. References [12] and [13] presented several heuristic 
solutions based on graph coloring and SAT solvers. 

Index Coding can be considered as a special case of the 
Network Coding problem [14]. The network coding technique 
extends the capability of intermediate network nodes by al- 
lowing them to mix packets received from different incoming 
edges. The goal of the network coding problem is to find the 
maximum rate between the source and destination pairs in a 
communication network with limited edge capacities. Initial 
works on the network coding technique focused on estab- 
lishing multicast connections. It was shown in [14] and [15] 
that the capacity of a multicast network, i.e., the maximum 
number of packets that can be sent from the source 5 to a 
set T of terminals per time unit, is equal to the minimum 
capacity of all the cuts that separates the source s from any 
terminal t G T. In a subsequent work, Koetter and Medard 
[16] developed an algebraic framework for network coding 
and investigated linear network codes for directed graphs 
with cycles. This framework was used by Ho et al. [17] to 
show that linear network codes can be efficiently constructed 
through a randomized algorithm. Jaggi et al. [18] proposed 
a deterministic polynomial-time algorithm for finding feasible 
network codes in multicast networks. References [19], [20] 
provide a comprehensive overview of network coding. 

Contributions 

In this paper, we study the relation between the index coding 
problem and the more general network coding problem. In 
particular, we establish a reduction that maps any instance of 
the network coding problem to a corresponding instance of 
the index coding problem. We show that several important 
properties of the network coding problem carry over to index 
coding. Specifically, by applying our reduction to the network 
presented in [21], we show that vector linear solutions are 
suboptimal. We also present an instance of the index coding 
problem in which splitting a message into two packets yields a 
smaller number of transmissions than a scalar linear solution. 

We also study the relation between the index coding prob- 
lem and matroid theory. In particular, we present a reduction 
that maps any matroid to an instance of the index coding 
problem such that the problem has a special optimal vector 
linear code that we call perfect index code if and only if 
the matroid has a multilinear representation. This construction 
constitutes a means to apply numerous results in the rich field 
of matroid theory to index coding, and, in turn, to the network 
coding problem. Using results on the non-Pappus matroid, we 
give another example where vector linear codes outperform 
scalar linear codes. 

The rest of the paper is organized as follows. In Section [n| 
we discuss our model and formulate the index and network 



coding problems. In Section IV we present a reduction from 



^Reference [10] refers to the Index Coding problem as the Local Mixing 
Problem. 
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the network coding problem to the index coding problem. In 



Section IV we discuss the relation between the index coding 
problem and matroid theory. In Section |Vj we apply our 
reductions to show the sub-optimality of linear and scalar 
index codes. Next, in Section |Vl] we discuss the relationship 
between networks and matroids and introduce a new family 
of networks with interesting properties. Finally, conclusions 



appear in Section VII 



II. Model 

In this section, we present a formulation of the network 
coding and index coding problems. 

A. Index Coding 

An instance of the index coding problem R) includes 

1) A set of k messages X = {xi, . . . 

2) A set of clients or receivers R C {[x, H); x e X, H C 
X\{x}}. 

Here, X represents the set of messages available at the sender. 
Each message Xi belong to a certain alphabet H^. A client is 
represented by a pair (x^H), where x e X is the message 
required by the client, and C X is set of messages available 
to the client as side information. Note that in our model each 
client requests exactly one message. This does not incur any 
loss of generality as any client that requests multiple messages 
can be substituted by several clients that require a single 
different message and have the same side information as the 
original one. 

Each message Xi can be divided into n packets, and 
we write x = {xn^ . . . ^Xin) G S^. We denote by ^ = 

(xii,...,Xi72,..., Xj^i •)••••, ^kn) ^ ^ 

Definition 1 (Index Code): An (n, index code for 
X(X, R) is a function / : — > S^, for a certain 
integer c, satisfying that for each client p = {x^H) G R, 
there exists a function ijjp : S^+^l^l — > ^u^h that 

We refer to c as the length of the index code. Define i{n^q) 
to be the smallest integer c such that the above condition holds 
for the given alphabet size g-ary and block length n. If the 
index code satisfies c = ^(n, g), it is said to be optimal. 

We refer to ipp as the decoding function for client p. With 
a linear index code, the alphabet S is a field and the functions 
/ and ipp are linear in variables Xij. If n = 1 the index code 
is called a scalar code, and for n > 1, it is called a vector or 
block code. Note that in our model a packet can be requested 
by several clients. This is a slightly more general model than 
that considered in references [7] and [9] where it was assumed 
that each message can only be requested by a single client. 

Given n and q, the index coding problem consists of finding 
an optimal index code for an index coding instance, for a 
given instance R) of the index coding problem, we define 
by X{n^q) = £{n^q)/n the transmission rate of the optimal 
solution over an alphabet of size q. We also denote by A*(n, q) 
the minimum rate achieved by a vector linear solution of 
block length n over the finite field ¥q of q elements. We are 
interested in the behavior of A and A* as functions of n and 



Let p{X) be the maximum of the total number of mes- 
sages requested by a set of clients with identical side in- 
formation, i.e., p{I) = maxycx (xi^Y) ^ R}\. Then, 
it is easy to verify that the optimal rate X{n^q) is lower 
bounded by p{T), independently of the values of n and q. 
To see this, let F* = argmaxycx {xi^Y) G R}\ and 
W = {xi; G R} and remove all clients that do not 

have the set Y* as side information. We note that, since 
F* n = 0, the minimum transmission rate of the resulting 
instance is equal to \W\ = p{T). Since the rate of the 
resulting instance is lower or equal to \{n^q) it holds that 
\(n,q) > p(X). 

Definition 2: Let R) be an instance of the index cod- 
ing problem. Then, an index code for X(X, R) that achieves 
A(n, q) = p{X) is referred to as a perfect index code. 

Note that the index code for the example in Figure [T] is not 
perfect, since, in that case, A = 2 and p = l. 

B. Network Coding 

Let G{V.,E) be a directed acyclic graph with vertex set V 
and edge sti E (Z V x V . For each edge e{u.,v) G E, we 
define the in-degree of e to be the in-degree of its tail node 

and its out-degree to be the out-degree of its head node v. 
Furthermore, we define V{e) to be the set of the parent edges 
of e, i.e., V{e{u, v)) = {{w, u); {w, u) G E)}. Let 5 C ^ be 
the subset of edges in E of zero in-degree and \ti D (Z E 
be the subset of edges of zero out-degree. We refer to edges 
in S as input edges, and those in D as output edges. Also, 
we define m = |^| to be the total number of edges, k = \S\ 
be the total number of input edges, and d = \D\ be the total 
number of output edges. Moreover, we assume that the edges 
in E are indexed from 1 to m such that S = {ei, . . . , e/c} and 
D = {em-d+i, • • • , em}- 

We model a coding network by a pair JV{G{V^ E),S) 
formed by a graph G{V,E) and an onto function S : D — > S 
from the set of output edges to the set of input edges. We 
assume that the tail node of each input edge e^, i = 1, . . . , 
holds message Xi, also denoted as x{ei). Each message Xi 
belongs to a certain alphabet H"^, for a positive integer n. 
The edges of the graph represent communication links of 
unit capacity, i.e., each link can transmit one message per 
channel use. The function S specifies for each output edge 
Ci, i = m — + 1, . . . , m, the source message x{S{ei)) 
required by its head node. We refer to S as the demand 
function. In a vector solution, each message Xi be divided 
into n packets (x^i, . . . ,Xin) G H"^. We also denote by ^ = 
(xii, . . . , xin^ . . . , Xkij . . . , Xkn) ^ the concatenation of 
all packets at the input edges. 

Definition 3 (Network Code): A g-ary network code of 
block length n, or an (n, network code, for the network 
^/{G{V,E),5) is a collection 



C = {fe = {fl...j:);eeEJl:^'' 



of functions, called global encoding functions, indexed by 
the edges of G, that satisfy, for all ^ G S"^^, the following 
conditions: 

(Nl) /e,(0=^i, forz = l,...,fc; 
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Fig. 2. An instance to the network coding problem equivalent to the instance 
of the index coding problem depicted in Figure^ 

(N2) /e, (0 = x{S{ei)), for i = m - d + 1, . . . , m; 
(N3) For each e = {u,v) G E \ S with V{e) = 
{ei, . . . , ep^}, there exists a function (/)e : S^^"^ — > S^, 
referred to as the local encoding function of e, such 
that /e(0 = 0e(/ei (0^ • • • ^ /e^, (0)' whcrc Pe is the in- 
degree of e, and V{e) is the set of parent edges of e. 
When n = 1, the network code is referred to as a scalar 
network code. Otherwise, when n > 1, it is called a vector or 
a /?/6>c^ network code. We are mostly interested here in linear 
network codes where S is a finite field F, and all the global 
and local encoding functions are linear functions of the packets 
Xij. Note that, a scalar linear network code over GF{p^) will 
naturally induce a vector linear network code of block length 
n over GF(jp)\ however, the converse is not necessarily true. 

III. Connection to Network Coding 

We first note that network coding is a more general problem 
than index coding. Indeed, for every instance of the Index 
Coding problem and a given integer c, there exists a corre- 
sponding instance of the network coding problem that has an 
(n, q) network code solution if and only if there exists an (n, q) 
index code of length c.n. For example. Figure |2] depicts the 
instance of the network coding problem that corresponds to the 
instance of the index coding problem presented in Figure [T] 
where the broadcast channel is represented by c "bottleneck" 
edges. 

In this section we present a reduction from the network 
coding problem to the index coding problem showing that 
two problems are equivalent. Specifically, for each instance 
M{G{V^ E)^ 5) of the network coding problem, we construct a 
corresponding instance Tj\f{Y-, R) of the index coding problem, 
such that Xj\f has an (n, perfect linear index code if and only 
if there exists an (n, q) linear network for M. 

Definition 4: Let J\f{G{V^ E)^ 5) be an instance of the Net- 
work Coding problem. We form an instance Tj\f(X^ R) of the 
Index Coding problem as follows: 



1) The set of messages Y includes a message Ui for each 
edge ei e E and all the messages Xi G X, i.e., 
Y = {yi,...,ym}DX; 

2) The set of clients R = RiU ■ ■ - U defined as follows: 

a) Ri = {{xi,{yi});ei G S} 

b) R2 = {{yi,{xi})]ei e S} 

c) Rs = {{y^, {yj;ej e V{e,)}); e,eE\S} 

d) R4 = {{x{S{e,)),{y,});e,eD} 

e) R5 = {{yi,X);i = l,...,m} 

It is easy to verify that instance T^f{Y^ R) satisfies iJi{Tj^) = 
m. 

Theorem 5: Let M{G{V^ E)^ X^5) be an instance of the 
network coding problem, and let Tj\f(X-, R) be the correspond- 
ing instance of the index coding problem, as defined above. 
Then, there exists an (n, q) perfect linear index code for Xj\f, 
if and only if, there exists a linear {n^q) network code for A/". 

Proof: Suppose there is a linear {n^q) network code 
C = {/e(X); /e : (F^)^ ^ F^, e G E} for AT over the finite 
field ¥q of size q for some integer n. 

Define g : (F^)^+^ ^ (F^)^ such that 
VZ = G (F-r+^^(Z) = 

{gi{Z),...,gm{Z)) where g,{Z) = y^ ^ /e,(X), 
z = 1, . . . , m. More specifically, we have 

gi{Z) =yi^Xi i = 1, . . . 

gi{Z) = yi ^ fe^{X) i = A: + 1, . . . , m - 

gi{Z) = yi -\- x{5{ei)) z = m — d + 1, . . . , m. 

Next, we show that g{Z) is in fact an index code for by 
proving the existence of the decoding functions. We consider 
the following five cases: 

1) Wp = {xi,{yi}) e Ri.iljp = gi{Z) - yi, 

2) Vp=(y„{xJ)Gi?2,V^p = ^i(Z)-^^, 

3) Vp = G R3, since C is a linear 
network code for A/", there exists a linear function 

such that /e,(X) = (/)e,(/e,^(X),...,/e.^(X)). ThuS, 

i^P = 9i{Z) -(})eX9iAZ) -yi^,...,gi^{Z) -yi^), 

4) Vp = (x(^(e,)), {yi}) G i?4,e^ eD,^p = g,{Z) - yi, 

5) Vp=(y„X)Gi?5,V^p=^^(Z)-/e.(X). 

To prove the converse, we assume that g : — > 
(F^)^ is a perfect linear {n^q) index code for Ij\f over the 
field ¥q. Again, we denote Z = (xi, . . . , x/c, . . . , ym) G 
(^n)m+/c^ and g{Z) = (^i(Z), . . . , ^^(Z)), x,,y, and 
gi{Z) G F^. We also write 

k m 

for z = l,...,m, and Aij^Bij G M^^{n^n), where 
MFg(n, n) is the set of n x n matrices with elements in F^. 

The functions ijjp exist for all p G if and only if 
the matrix M = [Bij] G M^^{nm^nm), which has the 
matrix Bij as a block submatrix in the (i, j)th position, is 
invertible. Define h : (F^)^+^ — > (F^)^, such that h{Z) = 
g{Z)M-^,yZ G (F^)^+^. So, we obtain 

k 

hi{Z) = yi^^XjCij.i = l,...,m, 
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where Cij G MFg(n,n). We note that h{Z) is a vahd index 
code for X^. In fact, Vp = (x, H) e R with (^j^e//) = 

X, \lj'p{h,{z)^^H) = iJp{hM,{z)^eH)) is a vahd decoding 
function corresponding to the cHent p and the index code 

HZ). 

For all p e RiU R4, ifj'p exists iff for z = 1, . . . , /c, m — 
(i + 1, . . . , m, j = 1 ... A: and j ^ i ii holds that Cij = [0] G 
Mf^ (n, n) and C^i is invertible, where [0] denotes the all zeros 
matrix. This implies that 

hi{Z) = yi^ XiCii, i = l,...,k 

k 

hi{Z) = Hi -\- ^ XjCij^i = /c + 1, . . . , m — d (1) 

hi{Z) = Ui^ x{5{ei))Cii, z = m- (i+l,...,m 

Next, we define the functions fe- : (F^)^ — > W^.ei e E SiS 
follows: 

1) fe, {X) = Xi, for z = 1, . . . , /c 

2) fe, {X) = Yl)=i ^j^ij^ for i = k^l,...,m-d 

3) fei{X) = x{6{ei)), for i = m - d + 1, . . . ,m. 

We will show that C = {fa'^^i G ^} is a linear {n^q) 
network code for M by proving that condition N3 holds. 

Let Ci be an edge in £^ \ with the set of parent edges 
V{ei) = {ci^,. . . , Ci^} . We denote by li = {h, . . . , ip} and 
Pi = iUii {Uii I'-'iUip}) ^ Then, there is a linear function 
ip'p. such that Vi \lj'p^{hi, . . . .hm.Vn, • • • ,yi^)^ Hence, there 
exist matrices Tij^T[^ G M^^{n^n) such that 

m 

yi = Y^ ^j^ij + vc^^L (2) 

Substituting the expressions of the hj's given by Eq. ([T]) in 
Eq. ([2]), we get that the following: 

• Tii is the identity matrix, 

• = -Tia^a e 

• = [0] Vj ^ /, U {i}. 
Therefore, we obtain 

fe. = -Y.fe^T,o.^ye,eE\S, 

aeli 

and C is a feasible network code for A/". ■ 
Lemma 6: Let JV{G{V^ E)^S) be an instance of the Net- 
work Coding problem, and let Tj^{Y^ R) be the corresponding 
index problem. If there is an (n, q) network code (not neces- 
sarily linear) for AT, then there is a perfect (n, q) index code 
for Ij\f. 

Proof: Suppose there is an (n, network code C = 
{/e(X);/e : (5]^)^ ^ S^,e G E} for Af over the q-ary 
alphabet S. Without loss of generality, we assume that S = 
{0,l,...,g-l}. 

Define g : (5]^)^+^ ^ (S^)^ such that 

{g,{Z),...,gm{Z)) with 

^i(Z) = fe,{X), i = l,...,m 

where "+" designates addition in GF{q)^. Then, the same 
argument of the previous proof holds similarly here, and g is 
an index code for Xj\f. ■ 



IV. Connection To Matroid Theory 

A matroid MiY^r) is a couple formed by a set Y and a 
function r : 2^ — > No, where 2^ is the power set of Y and 
No is the set of non-negative integer numbers {0,1, 2,...}, 
satisfying the following three conditions: 
(Ml) r{A) < \A\ for VA C Y; 
(M2) r{A) < r{B) for WACBCY; 
(M3) r{A U 5) + r{A n B) < r{A) + r{B) for VA, B CY. 

The set F is called the ground set of the matroid M. The 
function r is called the rank function of the matroid. The rank 
tm of the matroid A/1 is defined as tm = r{Y). 

We refer to 5 C F as an independent set if r{B) = 
otherwise, it is referred to as a dependent set. A maximal 
independent set is referred to as a basis. It can be shown that 
all bases in a matroid have the same number of elements. In 
fact, for any basis B, it holds that r{B) = \B\ = vm- A 
minimal dependent subset C C F is referred to as a circuit. 
For each element c of C it holds that r{C \ {c}) = |C| — 1 = 
r{C). We define "^{M) to be the set of all the bases of the 
matroid M, and (t{M) be the set of all circuits of M. 

Matroid theory is a well studied topic in discrete mathemat- 
ics. References [22] and [23] provide a comprehensive discus- 
sion on this subject. Linear and multilinear representations of 
matroids over finite fields are major topics in matroid theory 
(see [22, Chapter 6], [24], and [25]). 

Definition 7: Let Y = be a set whose el- 

ements are indexed by the integers from 1 to m. For any 
collection of m matrices Mi, . . . , Mm G MF(n, /c), and any 
subset / = {yi-^ , . . . , ?/^^} C F, with ii < • • • < i^, define 

Mi = [M,,\...\M,,]eM^{n,8k). 

That is the matrix M/ obtained by concatenating matrices 
Mi^ , . . . , from left to right in the increasing order of the 
indices zi, . . . , z^^. 

Definition 8: Let A4(F, r) be a matroid of rank tm = k 
on the ground set F = . . . , y^}. The matroid M is said 
to have a multilinear representation of dimension n, or an n- 
linear representation, over a field F, if there exist matrices 
Ml, . . . , Mm G MF(A:n, n) such that, V/ C F, 

rank(M/) = n • r(/). (3) 
Linear representation corresponding to the case of n = 1 is 
the most studied case in matroid theory, see for example [22, 
Chapter 6]. Multilinear representation is a generalization of 
this concept from vectors to vector spaces, and was discussed 
in [24], [25]. 

Example 9: The uniform matroid [/2,3 is defined on a 
ground set F = {^1,^2,^3} of three elements, such that 
V/ C F and |/| < 2,r(/) = |/|, and r(F) = 2. It is easy 
to verify that matrices Mi = [0 1]^,M2 = [0 1]^,M3 = 
[1 1]^ form a linear representation of of dimension 1 
over any field. This will automatically induce a multi-linear 
representation of dimension 2, for instance, of [/2,3 over any 
field: 
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Fig. 3. A graphical representation of the non-Pappus matroid of rank 3 [22, 
p.43]. Cycles are represented by straight lines. 



Example 10: The non-Pappus matroid (see e.g., [22, §1.5]) 
Mnpiy^"^) is defined over a ground set Y = . . . , ^/g} 
and can be represented geometrically as shown in Figure 
[S] Let Yo = {{1,2, 3}, {1,5, 7}, {3, 5, 9}, {2, 4, 7}, {4, 5, 6}, 
{2, 6, 9}, {1,6, 8}, {3, 4, 8}}. The rank function of the non- 
Pappus matroid is r(/) = min(|/|,3) V/ G 2^ \Yo, and 
r(/) = 2 for V/ G Yq. Note that Yq is the set of circuits of 
the non-Pappus matroid. 

It is known from Pappus theorem [22, p. 173], that the non- 
Pappus matroid is not linearly representable over any field. 
However, it was shown in [24] and [25], that it has a 2-linear 
representation over GF(3), given below by the following 6x2 
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(4) 

In the rest of this section, we present a reduction from the 
matroid representation problem to the index coding problem. 

Definition 11: Given a matroid M.(Y^r) of rank k over 
ground set Y = {?/i, . . . , ?/^}, we define the corresponding 
Index Coding problem Im{Z^ R) as follows: 

1) Z = ruX, where X = {xi,...,x/e}, 

2) R = R1UR2UR3 where 

a) Ri = {{xi, B)- B e ^{M), i = l,...,k} 

b) R2 = {{y,C\{y}y,Ce(t{M),yeC} 

c) Rs = {{yi,X);i = l,...,m} 
Note that /j^{Tm) = ^• 

Theorem 12: Let M{Y^r) be a matroid on the set 
Y = {yi^ . . . ^ym}, and Xm{Z^ R) be its corresponding Index 
Coding problem. Then, the matroid M has an n-linear rep- 
resentation over ¥q if and only if there exists a perfect linear 
(n^q) index code for Xm- 

Proof: First, we assume that in Im{Z^ R) all messages 
are split into n packets, and we write yi = {yn^ . . . ^yin), Xi = 

(x^i , . . . , X^n) G IFg , ^ = (Xii , . . . , . . . , Xj^i ')••'') '^kn) ^ 

F^^, and X = (^ll, • • • , ^In, • • • , ^ml, • • • , ^mn, 



Let Ml, . . . , Mra G MFg(/cn, n) be an n-linear represen- 
tation of the matroid M. Consider the following linear map 

/(x) = (/l(x),...,/m(x)) 

/z(x)=^z+^M,GF^,z = l,...,m. 

We claim that / is a perfect {n^q) linear index code for Xm • 
To this end, we show the existence of the decoding functions 
of condition (II) for all the clients in R: 

1) Fix a basis B = {yi^, . . . ,yi^} G ^{M), with n < 
i2 < ■ ' ' < ik, and let pi — (x^, B) G i = 1, . . . , /c. 
By Eq. ^ rank(M^) = /cn, hence the kn x kn matrix 
Mb is invertible. Thus, the corresponding decoding 
functions can be written as 

= [fii -yii\"'\fik -VikPi^ 

where the Ui's are the kn x n the block matrices that 
form M^^ in the following way: 

[U,\...\Uk]=M-\ 

2) Let C = {yi,,. . .,yij e €{M), with n < ^2 < • • • < 
ic, and p = {yi, , C) G R2, with C' = C — yi,. We have 
rank(Mc/) = rank(Mc) by the definition of matroid 
cycles. Therefore, there is a matrix T G Mf^ {cn — n, n), 
such that. Mi, = M^T. Now, note that 

[fi2 -yi2\'"\fic - Vic] = iMc'- 

Therefore, the corresponding decoding function is 

i^p = fh - [fi2 - 1 • • • l/ic - yio]T- 

3) For all p={y,,X)e Rs^ ^,{f, i) = h - ^M,. 
Since this index code satisfies the lower bound p{Xm) = 
it is a perfect index code. 

Now, suppose that /(x) = (/i(x), • • • , /m(x)), Mx) ^ 
is a perfect (n, q) linear index code for Xm • We will show that 
this will induce an n-linear representation of the matroid A4 
over ¥q. 

Due to the clients in Rs, we can use the same reasoning 
used in the proof of the converse of Theorem [5] and assume 
that the functions fi{x)^^ = 1, • • • , m, have the following form 



(5) 



where the A^'s are kn x n matrices over F^. We claim that 
these matrices form an n-linear representation of M over ¥q. 
To prove this, it suffices to show that the matrices Ai's satisfy 
Eq. ([3]) for all the bases and cycles of M. 

Let 5 G ^{M) a basis. Then, by Eq. ([5]), the clients 
{xj,B),j = 1, . . . , /c, will be able to decode their required 
messages iff is invertible. Therefore, rank(AB) = nk = 
nr{B). 

Let C G (t{M) a circuit. Pick yi, G C let C = C - yi,. 
We have r{C') = \C\ - 1 = |C'|,i.e., C is an independent 
set of the matroid, and there is a basis B of M such that 
C C B (by the independence augmentation axiom [22, 
chap. 1]). Thus, from the previous discussion, Ac has full 
rank, i.e. rank(74c") = {\C\ — l)n. Now consider the client 
p = {yi^jC) G R2, the existence of the corresponding 
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matroid Mnp does not admit any linear representation, by 



Theorem 12 there is, also, no perfect scalar linear index code 
for ^Mnp- Nevertheless, the multilinear representation of the 
non-Pappus matroid over GF{3) described in Example 10 
induces a perfect (3, 2) vector linear index code for Tj^r^p- 

Corollary 15: For instance ^Mrip of the Index Coding 
problem it holds that A*(2, 2) < A*(l, 2). 

Proof: Follows directly from Theorem 12 




Xi Xs Xi X4 X2 Xs X2 



Fig. 4. The M-Netwrok M [26]. 



X4 



linear decoding function ipp implies that there exists a matrix 
T e MF(|C|n-n, n) such that Ai^ = Ac'T. So, rank(Ac) = 
rank(AcO = n{\C\ - 1) = nr{C). 



V. Properties of Index Codes 
A. Block Encoding 

Index coding, as previously noted, is related to the problem 
of zero-error source coding with side information, discussed by 
Witsenhausen in [5]. Two cases were studied there, depending 
on whether the transmitter knows the side information avail- 
able to the receiver or not. It was shown that in the former case 
the repeated scalar encoding is optimal, i.e., block encoding 
does not have any advantage over the scalar encoding. We will 
demonstrate in this section that this result does not always 
hold for the index coding problem, which can be seen as an 
extension of the point to point problem discussed in [5]. 

Let Ml be the M-network introduced in [26] and depicted in 
Figure]?] It was shown in [27] that this network does not have 
a scalar linear network code, but has a vector linear one of 
block length 2. Interestingly, such a vector linear solution does 
not require encoding and can be solved using a forwarding 
scheme. A more general result was proven in [27]: 

Theorem 13: The M-network has a linear network code of 
block length n if and only if n is even. 

Consider an instance I^i of the index coding problem 
corresponding to the M-network obtained by the construction 
of Definition |4] By theorem 13 I^i does not admit a perfect 
scalar linear index code. It has, however, a perfect linear 
index code of block length 2, over any field. Thus, l^i 
is an instance of the index coding problem where vector 
linear coding outperforms scalar linear one. This result can 
be summarized by the following corollary: 

Corollary 14: For JaTi , A*(2, 2) < A*(l,2). 

Proof: Follows directly from theorems [5] and [13] ■ 

Another similar instance of the index coding problem is 
^Mnp obtained by applying the construction of Definition 
11 to the non-Pappus matroid Mnp- Since the non-Pappus 



B. Linearity vs. Non-Linearity 

Linearity is a desired property for any code, including index 
codes. It was conjectured in [7] that binary scalar linear index 
codes are optimal, meaning that A* (1,2) = A (1,2) for all 
index coding instances. Lubetzky and Slav disproved this 
conjecture in [9] for the scalar linear case by providing, for 
any given number of messages k and field F^, a family of 
instances of the index coding problem with an increasing gap 
between A*(l,g') and A(l,g'). 

In this section, we show that vector linear codes are still 
suboptimal. In particular, we give an instance where non- 
linear index codes outperform vector linear codes for any 
choice of field and block length n. Our proof is based on 
the insufficiency of linear network codes result proved by 
Dougherty et al. [21]. Specifically, [21] showed that the 
network A/3, depicted in Figure [5] has the following property: 

Theorem 16: [21] The network A/3 does not have a linear 
network code, but has a (2,4) non-linear one. 

Let 2'^3 be the instance of the Index Coding problem that 
corresponds to A/3, constructed according to Definition |4] 
Theorem 16 implies that Xj\f^ does not have a perfect linear 
index code. However, by Lemma [6] a (2, 4) non-linear code of 
A/3 can be used to construct a (2,4) perfect non-linear index 
code for Xj^^ that satisfies A (2, 4) = /i(X). We summarize this 
result by the following corollary. 

Corollary 17: For the instance Xj\f^ of the Index Coding 
problem, it holds that A(2,4) = ii{Xm^) < A*(2,4). 

VI. Discussion 

A. Matroids and Networks 

Dougherty et al. [21], [27] used results on the representabil- 
ity of matroids for construction of network A/3 depicted in 
Figure [5] that served as a counter example to the conjecture 
of the sufficiency of linear network codes. They defined also 
the concept of a matroidal network, and presented a method 
for constructing networks from matroids [27, Section V.B]. 
Given a certain matroid, they design an instance to the network 
coding problem that forces the same independency relations 
of the matroid to exist in the set of edge messages. However, 
not all of the matroid dependency relations are reflected in this 
network. 

In this paper, we present a new construction that remedies 
this problem. Our construction is based on the reduction 



presented in Section [IV] It provides a stronger connection 
between matroids and network codes. Specifically, for a given 
matroid, we construct a network such that any multilinear 
representation of it will induce a linear network code for 
the obtained network over the same field, and vice versa. 
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xi X2 yi 2/2 2/3 2/4 2/9 




2/1, •• • ,2/9 a^i a:2 2/1 2/2 2/3 



Fig. 6. Part of the network resulting equivalent to the non-Pappus matroid resulting from the construction of Definition [Ts] 



network, and connect it to an output edge that demands 
source z. And, for each z' G H, add edge (5', Up), where 
s' G Vi is connected to an input edge carrying source 

z\ 

6) For each p e R, add edge {rij^ rip), for j = 1, . . . , m. 

Proposition 19: The matroid has an n-Hnear represen- 
tation over ¥q iff the network M{Tm) has an (n, Hnear 
network code. 

Proof: It can be easily seen that any (n, perfect Hnear 
index code for Xm will imply an {n^q) linear network code 
for M{Xm)^ and vice versa. The proof follows, then, directly 
from Theorem [121 ■ 
Figure |6] shows a sub-network of M(TMnp) resulting from 
the construction of Definition [18] applied to the non-Pappus 
matroid Mnp of Figure [3] Node ni represents the clients in 
the set i?3 of Tmup^ ^2 the basis {1, 2, 4} of the non-Pappus 
matroid, and 713,714,715 the cycle {1,2,3}. 




Fig. 5. The network A/3 of [21]. A/3 does not have a linear network code 
over any field, but has a non-linear one over a quaternary alphabet. 



This result will permit us to apply many important results on 
matroid linear representability to the network coding theory. 

Definition 18: Let M{Y^ r) be a matroid of rank k defined 
on the set F = {2/1, . . . , and Xm{Z^ R) the correspond- 
ing index coding problem as described in Definition 11 We 
associate to it the 6-partite network M{Tm) constructed as 
follows: 

1) F D U U Fa, where Vi = {^i, . . . , Sm+k}, V2 = 
{71;,..., 71;^}, and Vs = {<,..., n;;}. 
Connect each node Si^i = 1, . . . , /c, to an input edge 
carrying an information source Xi at its tail node, and 
each node Si^i = /c + 1, . . . , 771 + /c, to an input edge 
carrying an information source y^. 



2) 



0, for 



1, 



, m 



3) Add edges {si^rij) and {si 
and j = 1, . . . , 771. 

4) Add edges {n'^^n'-) for j = 1, . . . , 7ti. 

5) For each client p = {z^ H) e R, add a vertex 7ip to the 



B. Special case 

Determining the capacity and design of efficient solutions 
for the general network coding problem is a long-standing 
open problem. In particular, it is currently not known whether 
the general problem is solvable in polynomial time, NP- 
hard, or even undecidable [28]. Reference [29] proved that 
determining the scalar liner capacity is NP-hard, however, it 
is not known whether this result holds fro the vector-linear or 
general network codes. A important problem in this context is 
whether the hardness of the general network coding problem 
carries over to special cases of practical interest. For example, 
references [26], [30] shat that restricting the network coding 
problem to multiple unicast connections does not result in a 
loss of generality. In particular, [30] presents a procedure that 
transform an instance of the network coding problem into an 
equivalent instance with multiple unicast connections. 

In this section we describe a new class of the network 
coding problems which captures many important properties 
of the general problem. The instances that belong to this 
class have a simple structure with a 4-partite underlying 



9 



xi X2 yi 2/2 y3 




Fig. 7. (a) The butterfly network, (b) the equivalent ICN network. A 
corresponding network code consists of the encoding functions yi , 2/2 + 
3^2, 2/3 + + X2 on the bottleneck edge. 



communication graph. In particular, the class is defined by 

{JV{X^' )]M' is a communication network} , 

where Xj\f> is the instance of the Index Coding problem 
constructed as per Definition [4] and J\f{Tj\f') is the instance of 
the network coding problem obtained through the construction 
depicted on Figure [2] Figure [TJb) shows an example of 
an instance of a coding network that belongs to this class. 
Theorem |5] implies that for any coding network M' it holds 
that M' has an (n, linear network code if and only if such 
a solution exists for AT* = M{X^f' ) . 

VII. Conclusion 

This paper focuses on the index coding problem and its 
relation to the network coding problem and matroid theory. 
First, we presented a reduction that maps an instance M of 
the network coding problem to an instance Xj^ of the index 
coding problem such that M has a vector linear solution if and 
only if there is a perfect index code for Xj^. Our reduction 
implies that many important results on the network coding 
problem carry over to the index coding problem. In particular, 
using the M-network described in [26], we show that vector 
linear codes outperform scalar ones. In addition, by using the 
results of Dougherty et al. in [21] we show that non-linear 
codes outperform vector linear codes. 

Next, we present a reduction that maps an instance of the 
matroid representation problem to the instance of the Index 
coding problem. In particular, for any given matroid M we 
construct an instance of the index coding problem Xm^ such 
that M has a multilinear representation if and only if Xm has a 
vector linear solution over the same field. Using the properties 
of the non-Pappus matroid, we gave another example where 
vector linear codes outperform than scalar linear ones. 

Our results imply that there exists a strong connection 
between network coding and matroids. In addition, our results 
yield a family of coding networks that have a simple structure, 
but still capture many important properties of the general 
network coding problem. 
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